|
|
Accession Number |
TCMCG009C29146 |
gbkey |
CDS |
Protein Id |
XP_030485875.1 |
Location |
join(68085..68262,68341..68388,68468..68622,68703..68760,69086..69143,69296..69388,69462..69624,69720..69735,69838..70058,70250..70373,70524..71876,71958..72012,72105..72214,72307..72362,72479..72682,72808..72852,72938..72991,73072..73178,73257..73305,73425..73484,73571..73643,73750..73835,73919..74092) |
Gene |
LOC115702602 |
GeneID |
115702602 |
Organism |
Cannabis sativa |
|
|
Length |
1179aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA560384 |
db_source |
XM_030630015.1
|
Definition |
DNA mismatch repair protein MLH3 isoform X1 [Cannabis sativa] |
CDS: ATGGCCACCATCCAATCATTGCCCCAACCTCTTCGTACTTCACTGCGCTCTGGGATCATTCTTTTTGACATTGCCACTGTGGTCGAAGAGCTCGTTTTCAACAGCCTCGACGCTGCTGCTTCAACGGTATCAGTTTTTGTAGGCGTTGGGAGTTCTTATGTCAAAGTGGTAGATGATGGATCAGGCATTCCGAGAGATGATTTGGTGCTTTTGGGAGAAAGATATGCAACATCGAAGTTTGATCATTTAGCTGATAAGGATAATGAATGTAAGAGCTTTGGGTTTCGAGGGGAAGCATTGGCTTCAATTTCCGATGTATCCTTGTTAGAAGTTGTGACAAAAGCTTCTGGGAGGCCTAATGGATACCGCAAAGTTATGAAGGGAAGCAAATGTTTGTATCTCGGAATATATGATGATAGGAAGGATGTGGGAACGACAGTTTCTGTTCGGGATTTATTTTACAACCAACCAATTCGGAGGAAGTATATTCAATCAAGCCCTAAGAAGGTCTTGCAATCGATCAAGAAATGCGTACAGAGAATTTCCCTTGTGCACTCAAATGTATCTTTTAAAGTTGTTGATATAGAAAGTGAGGATGTACTTCTCTACACACTACCTTCTTCTCCCATGTCCCTATTGACCACTTGTTTTGGGACCGAGATCTCCACCTCTCTTCATGAATTGAAAAGTAGTAGCGGAAAAATTGAGCTTTCTGGATACATATCTTCCCCTTGTGATAATTTGAGCATCAAGGTCTTTCAATATATCTATATTAATTCACGGTATGTGTGCAAAGGGCCAATCCATAAATTGGTGAATCAGTTGGCAACTAGATATCATTGGTTCGATCAAGGAAAAGCTGTCAATAATTTCCAAACCAGAAAACGAAGGAGAGCTCAAATATACCCTGCTTTCATTTTGAACATAAGGTGCCCTCGCTCCTTTTATGATTTAAACTTTGAGCCATCAAAAACCTATGTAGAATTCAAGGATTGGGTTCCTGTACTTGCACTTCTTGAAGAGGTCATTCAAGACTTGTGGAAGGAAAATATGTCTGATATCAAGGGCGAAGATCTACTGTTGAACGGAGATGGAAACACCATATCTTTAGGAGATTTACATGATGTTTTCCCAAGAAACTCCACAACTGGGAATAAGAAGGGCAGAATCCAAAATTCTGAGGACTCTCTTGACCCTATTTCTTCTCATCTGAAGATGCCTATCAAAGAGCTCAACTACATGTCCCAGAGGAAACAGGATATGATTACAAATAAAAGCTCACAAAAATGTACTCATTACTTCGATGATCAAGAAGATGAGATGGATACATTTCATACTGACAATTCTCTTCAAGCATGGGATCAGCCCCTTGCCCAATGCAAGCTCAAAGTATCTAAAAACTACGAGCACCAGCCGTTCATTTCTAATAACCACTCTTTATCTACAGATGATTATTGCCTGGTAGATGAATATACTTCTGGAGAGAGATGCAGAGTGAGTGCAGATGTTAACTTCAGTTCTTCGTGGGGAAGAGATTCGTTTGAAGCTGATCTCGGTCTTAGCAGTGAAGCTGTTGATAGTTCATTGTGTTGTGATCATCAAGAACTCAGCAATGATGTAGCAGTGAGCAGGGATGGGAAGAGACCTTTTCTGAAGAGTTGCTCCTTCAGAGGGTGCCTTCCACAAGAGAGGACTTTGTGTACAGATGATGTTTGTAAGTTTAAAAGTGATAACTTTAAGATCAACCAAATGTTGAACTGGCCTTATGGCGGGGTTAATGTTTCAGAGACTAACGAGAATTTTGATTTCTTGTCATGTCCTTTGCGGGAGGCCAACACATCAAGATTTCAGTCCTCTCCAACTGCAGAAATCGGTAGTTTTTCAAGATCTTCCCTAAGGTCATTCCCACTTTATGAAGAGCCTACCACTGACTATAATGATGGTTTCTTGTCTGATTCGGTTAAAAGGATAGAGACTATTGGCTCTAATCATCTCAATTTGGACCCTGAATGGTGCTCTTTGAGCTTAGATTCGATTTCCCAGCCCACACCTTGGGATGTAGATCATTACACTGACTGCAGATATAGTAATGATGTACTGGAAAAGAGATCTAGCCAAGAAAAGTGGAATGCAAGCTGTTCATACAATGAATTTTCAGATGCGGACTTGGGTGAATTCCTTCCTAGGCACAATTTAAACAAAAGACTTCCCTCTAAGTCGGTGAACATATCAAATCATGGGACAGACTGGTTAAGTGAAGTATCTCTTGGTAAAAACCAGTTGAGTAGTGAGATGTACAAAAGTCAAAGAGATCAAAATAGTTATGATGAAAGTGAAGGGAATGGTCATTTCTCTAAACGAAGATCAAGAAGCCACTCAGCTCCGCCATTTTGTAGAAGCAAGAGAAAGTTCTTGACCTTAAACTTCCATTCTACAGGAAAAGGGGGCAATGACCCAGCTTATCCAGAAGGTCGTGAACGGTGGAAGCCTATAGTTTTGGGGGATTCGCTGTTAGACAACAGGCTAGATTGGAAGAATCTACAAGACCTGAAGGAAGACTTAACGGAGATTAGAAGTGAGGAAAGGCTTGAACAGTCTGTTTGTTTCGACATTCAAGATGCTCCATTTAAAGATAATGTTTCCCTGAATTGTGGGAGCAAATGGCGGAACAGCTGCCCAAAGACTGCTCACAATAATAAGTTGCATCACCATGACATCCAGAATGAGAGTAGTATTCTCGATATCTCTTCGGGATTCTTGCACCTTGCTGGTGACTCTTTAGTTCCTGAATCTATGAACAAAAACAGCCTTAAGGAAGCCATTGTTCTCCAACAGATTGATAAAAAATACATTCCAATTGTGGCTGGGAAAACTCTTGCTGTCGTTGATCAGCACGCTGCAGATGAACGAATCCGACTAGAGGAGTTGCGTCAGAAGGTCTTGTCTGGAGAAGCAAAGGAAATCACTTTTCTGGATGCAGAGAAAGAATTGATGCTGCCAGAGATTGGGCATCAGTTACTGCATAGCTATGCCAAAGAAATTAAAGAATGGGGTTGGATATGTAACATTCATGCTCAGGATTCAAAATCCTTCAAAAGGAATTTGAATCTTCTCCACAACCGGCCAACGGTTATTAAGCTCGTTGCGGTACCATGCATCTTAGGAGTCAACTTGTCTGATATAGATCTCACTGAATTTCTCCAACAGCTGGCTGATACCGATGGATCTTCAACAATACCACCCTCTGTTCTTCGCGTCCTCAATTCCAAAGCATGTCGAGGTGCAATTATGTTTGGGGATGCATTGCTACATTCGGAGTGTTCCCTTATCATTGAAGAGCTAAAGCATACCTCCCTTTGTTTTCAGTGTGCACATGGGCGGCCGACAACTGCCCCGCTCGTGAACTTGGAGACACTGCATAAGCAAATAGCTAAGACTACACTGCACACTGATGATTCCAACGGGTTATGGCATGGCTTACGCCGACATGAGCTCAATGTTGAACGCGCTGAACAACGTTTGATGTCAGCATCTTGTTAG |
Protein: MATIQSLPQPLRTSLRSGIILFDIATVVEELVFNSLDAAASTVSVFVGVGSSYVKVVDDGSGIPRDDLVLLGERYATSKFDHLADKDNECKSFGFRGEALASISDVSLLEVVTKASGRPNGYRKVMKGSKCLYLGIYDDRKDVGTTVSVRDLFYNQPIRRKYIQSSPKKVLQSIKKCVQRISLVHSNVSFKVVDIESEDVLLYTLPSSPMSLLTTCFGTEISTSLHELKSSSGKIELSGYISSPCDNLSIKVFQYIYINSRYVCKGPIHKLVNQLATRYHWFDQGKAVNNFQTRKRRRAQIYPAFILNIRCPRSFYDLNFEPSKTYVEFKDWVPVLALLEEVIQDLWKENMSDIKGEDLLLNGDGNTISLGDLHDVFPRNSTTGNKKGRIQNSEDSLDPISSHLKMPIKELNYMSQRKQDMITNKSSQKCTHYFDDQEDEMDTFHTDNSLQAWDQPLAQCKLKVSKNYEHQPFISNNHSLSTDDYCLVDEYTSGERCRVSADVNFSSSWGRDSFEADLGLSSEAVDSSLCCDHQELSNDVAVSRDGKRPFLKSCSFRGCLPQERTLCTDDVCKFKSDNFKINQMLNWPYGGVNVSETNENFDFLSCPLREANTSRFQSSPTAEIGSFSRSSLRSFPLYEEPTTDYNDGFLSDSVKRIETIGSNHLNLDPEWCSLSLDSISQPTPWDVDHYTDCRYSNDVLEKRSSQEKWNASCSYNEFSDADLGEFLPRHNLNKRLPSKSVNISNHGTDWLSEVSLGKNQLSSEMYKSQRDQNSYDESEGNGHFSKRRSRSHSAPPFCRSKRKFLTLNFHSTGKGGNDPAYPEGRERWKPIVLGDSLLDNRLDWKNLQDLKEDLTEIRSEERLEQSVCFDIQDAPFKDNVSLNCGSKWRNSCPKTAHNNKLHHHDIQNESSILDISSGFLHLAGDSLVPESMNKNSLKEAIVLQQIDKKYIPIVAGKTLAVVDQHAADERIRLEELRQKVLSGEAKEITFLDAEKELMLPEIGHQLLHSYAKEIKEWGWICNIHAQDSKSFKRNLNLLHNRPTVIKLVAVPCILGVNLSDIDLTEFLQQLADTDGSSTIPPSVLRVLNSKACRGAIMFGDALLHSECSLIIEELKHTSLCFQCAHGRPTTAPLVNLETLHKQIAKTTLHTDDSNGLWHGLRRHELNVERAEQRLMSASC |